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I. Information-Theoretic Inequalities. 
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ABSTRACT: This paper is generally concerned with understanding how the un- 
^ ■ certainty principle arises in formulations of quantum mechanics, such as the decoherent 
histories approach, whose central goal is the assignment of probabilities to histories. We 
first consider histories characterized by position or momentum projections at two moments 
of time. Both exact and approximate (Gaussian) projections are studied. Shannon's in- 
formation is used as a measure of the uncertainty expressed in the probabilities for these 
histories. We derive a number of inequalities in which the uncertainty principle is expressed 
as a lower bound on the information of phase space distributions derived from the prob- 
abilities for two-time histories. We go on to consider histories characterized by position 
samplings at n moments of time. We derive a lower bound on the information of the joint 
probability for n position samplings. Similar bounds are derived for histories characterized 
by samplings of other variables. All lower bounds on the information of histories have the 
general form In {Vjj/Vg), where V/j is a volume element of history space, which we define, 
and Vg is the volume of that space probed by the projections. We thus obtain a concise 
and general form of the uncertainty principle referring directly to the histories description 
of the system, and making no reference to notions of phase space. 
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I. INTRODUCTION 



A quantum-mechanical history is defined by an initial quantum state at some time 
to, and by a sequence of propositions at a succession of times ti,t2---tn- The initial 
state is represented by a density matrix p. Each proposition is represented by a set of 
projection operators Pa- These are positive hermitian operators that are both exclusive 
and exhaustive: 

PaPf3 = Sap Pa, (1.1) 

a 

Evolution between each projection is described by the unitary evolution operator, eft. 
The probability for histories described in this way is given by the expression, 

^2, •••««)= ^ [PaStn) ■ ■■Pl,{tl)pPUti) ■ ■ ■ P^Jtn)) (1.3) 

where 

Eq.(1.3) is central to any formulation of quantum mechanics whose aim is the assign- 
ment of probabilities to histories. One particular such approach is the decoherent histories 
approach [1,2,3,4,5]. In that approach, the central aim is to find, for a given Hamiltonian 
and initial state, the sets of histories of closed quantum systems for which the probabilities 
(1.3) satisfy the so-called "probability sum rules". Loosely, these are the rules obtained 
by demanding that the probability of a composite history is the sum of the probabilities 
of the more elementary histories of which it is comprised. An example of such a sum rule 
(of which there are many) is, 

p{- ■ ■ ak_i, ak^i • ■ ■) = ^|)(- • • ak_i, aj,, a^+i ■ ■ •) (1.5) 

Oik 
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Histories satisfying these rules are said to be "consistent" , or "decoherent" , and it is solely 
in terms of such histories that predictions may be made. Quantum-mechanical interfer- 
ence means that these rules are generally not satisfied, and demonstrating consistency is 
typically non-trivial. 

The formula (1.3) also arises in a different context. It is a concise summary of the 
Copenhagen approach to the quantum mechanics of measured subsystems. It incorporates 
both the unitary evolution of states together with the "collapse of the wave function" 
incurred as a result of measurement by an external agency, modeled by the projection 

operators [6,7]. 

Irrespective of which interpretational scheme one is concerned with, the mathematical 
properties of the expression (1.3) are of interest. This paper is concerned with exploring 
those properties. 

Our particular concern is the question of how the uncertainty principle arises in (1.3). 
The usual form, 

ApAq > ^ (1.6) 

is a simple consequence of Fourier transform of the wave function of the system at a fixed 
moment of time. However, in formulations that give a central role to (1.3), the state of 
the system of the system at a fixed moment of time does not enter in a fundamental way. 
Instead, all physically meaningful notions must be expressed through the probabilities 
(1.3). It therefore becomes an important issue to understand how these probabilities 
recognize the uncertainty principle. It is not difficult to see that it will arise as a limitation 
on the degree to which (1.3) may be peaked about a particular history. This is because 
the probability (1.3) is a distribution over quantities that are generally non-commuting, so 
one would not expect it to become arbitrarily peaked. The aim of this paper is to establish 
the detailed form this limitation takes. 
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As a measure of the degree to which (1.3) is peaked, we shall use the Shannon infor- 
mation: 



This measure, for histories, is in many ways more natural and easier to use than the vari- 
ances, employed in (1.6). We shall show that the uncertainty principle generally arises as a 
lower bound on the information (1.7). In particular, for the case in which the alternatives 
aj^ are discrete, the probabilities (1.3) have an upper bound Pmaxi over all initial states p 
and over all possible values of the alternatives. If there is a restriction on the degree to 
which (1.3) is peaked, as one would expect when the projections do not commute, then 
Pmax < 1- The information (1.7) then has a non-trivial lower bound 



We begin in Section II with a brief review of some properties of Shannon information. 
We then go on in Section III to discuss information-theoretic measures of uncertainty 
in quantum mechanics. We review earlier work on information-theoretic versions of the 
uncertainty principle, expressed in terms of the state of the system at a fixed moment of 
time. 

In Sections IV and V we discuss quantum- mechanical histories of the form (1.3) char- 
acterized by position and/or momentum projections at two moments of time. We consider 
the case of both exact and approximate (Gaussian) projection operators. The general idea 
is to use the two-time histories to derive imprecise samplings of phase space, and then 
compute lower bounds on the information of the quantum-mechanical phase space distri- 
butions. In regimes where they are non-trivial, we find that all of the bounds have the 
approximate form. 




(1.7) 




(1.8) 




(1.9) 
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where I{K, X) is the information of the phase space distributions and (Txcrk ^^e volume 
of phase space probed by the projections. 

In Section VI we go on to study histories characterized by position samphngs at n 
moments of time. We show that the uncertainty principle arises as a restriction on the 
information of the approximate form, 



in the regime where it is non-trivial. Here, cr^ is the width of the position sampling i, and 
V^^ is a "density of paths" factor. We argue that Vjj thus has the interpretation as the 
"fundamental volume of history space", analogous to the factor of 271% in (1.9). We derive 
a result identical in form for histories characterized by other types of projections. We thus 
obtain a form of the uncertainty principle which is both concise and general, and is phrased 
entirely in the language of histories, without reference to phase space. We summarize and 
discuss in Section VII. 

Some words are in order concerning the use of Shannon information for the probabilities 
(1.3). Since the quantities defined by (1.3) generally do not in fact satisfy the probability 
sum rules, such as (1.5), they cannot strictly be regarded as probabilities. Use of the 
Shannon information (1.7) therefore requires some qualification. Although they do not 
obey the probability sum rules, the (candidate) probabilities (1.3) are non-negative and 
normalized, and thus the information (1.7) is a well-defined quantity, and may be used as 
a measure of the degree of spread of the candidate probability. The important point is 
that at no stage are the probabilities sum rules assumed, and thus no inconsistencies arise. 

It is of course an interesting question, from the perspective of the decoherent histories 
approach, to extend the considerations of the present paper to the case in which the candi- 
date probabilities (1.3) do obey the probability sum rules. Decoherence may be achieved, 
for example, by coupling the system of interest to an environment. Modifications of the 




(1.10) 
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uncertainty relations (1.9), (1-10) due to environmentally-induced (e.g. thermal) flucta- 
tions can then be expected. This is considered in Refs.[8,9]. The information-theoretic 
inequalities considered here then become conditions that such decoherering probabilities 
must satisfy in the limit that the coupling to the environment goes to zero. 



II. INFORMATION THEORY 

In this section, we briefly review some results from information theory. This section 
solely concerns generic probability distributions, and makes no reference to quantum me- 
chanics. 

Let Pi be the probabilities for a data set S consisting of discrete set of alternatives 
labeled hy i, i = 1,2 ■ ■ • N. One has < pi < 1 and ^iPi = 1. The information of the 
data set S is defined to be 

N 

I{S) = -Y^Pi^^Pi (2.1) 

i=l 

Here, In is the logarithm to base e. I{S) satisfies the inequalities 

0</(-5)<lniV (2.2) 

It reaches its minimum if and only if = 1, for one particular value of i, and so pi = for 
all the other values. It reaches its maximum when Pi = for all i. The information of a 
probability distribution is therefore a measure of how strongly peaked it is about a given 
alternative. For this reason, I{S) is sometimes referred to as uncertainty, being large for 
spread out distributions and small for concentrated ones. I{S) is sometimes also referred 
to as the entropy of the distribution, but we shall not use that nomenclature here. 

Base 2 is often used in the definition (2.1). In this case I{S) has the interpretation 
as the average number of bits required to specify an alternative, given that alternative i 
occurs with probability pi- 
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Information may also be defined for continuous probability distributions. Let X be a 
random variable with probability density p{x). Then J dx p{x) — 1. The information of 
X is defined to be 

I(X) = - J dx p{x) \np{x) (2.3) 

Unlike the discrete case, I{X) is no longer positive, since p{x) is not a probability, but a 
probability density, so may be greater than 1. However, it retains its utility as a measure 
of uncertainty. This is exemplified by a Gaussian distribution of variance Ax, 

Pi^) = r exp (2.4) 



It has information 



(27r(Ax)2)2 V 2(Ax) 



I{X) = In (27Te{Axf^ ' (2.5) 

From this we see that I{X) may be unbounded from below, and indeed, approaches — oo as 
Ax — > axidp{x) approaches a delta-function. I{X) is also unbounded from above, as may 
be seen by taking the width Ax to be very large. However, if the variance is fixed, then 
a straightforward variational calculation shows that I{X) is maximized by the Gaussian 
distribution (2.4). Eq.(2.5) therefore represents an upper bound on the information of 
probability distributions with variance Ax, 

I{X) < In (^27re(Aa;)2) ' (2.6) 

with equality if and only if p{x) is a Gaussian. 

The literature contains a vast number of results about information. We will record 
only one, since it will be needed later. Suppose from a probability distribution p{x) one 
constructs a "coarser-grained" probability distribution 

q{x) = J dx f{x,x) p{x) (2-7) 



for some smearing functfon f{x,x) satisfying J dxf{x,x) = 1. Then if we denote the 
information of q{x) by /(X), it may be shown that 

I{X) > I{X) (2.8) 

This inequahty expresses the intuitive idea that smearing or coarse-graining a probability 
distribution increases the amount of uncertainty it expresses. A corresponding result also 
holds for the discrete case. The result, for both the continuous and discrete case, follows 
readily from the convexity of the function xlnx, so we shall refer to this result as the 
convexity property. 

For further details on information theory, see Refs. [10,11]. 



III. INFORMATION-THEORETIC 
UNCERTAINTY RELATIONS 

We now describe a number of information-theoretic expressions of the uncertainty 
principle. We begin by describing the projection operators used to sample position and 
momentum. 

III(A). Samplings of Position and Momentum 

Approximate samplings of position may be carried out using projection operators. The 
projection operators effect a partition of the real line into regions (or "bins") of size ax- 
Explicitly, they take the form 

= J dx T{x - Xa) \x){x\ (3.1) 
where T(a; — Xa) is a sampling function. The most appropriate choice is to take it to be 

r(x -x„) = eh- "° + j 9 + + ^"A (3.2) 
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It is equal to 1 in an interval of size Ux centred around Xa and zero otherwise, where 
Xa = otcTxi and a is an integer. We will generally use a bar to denote coarse-grained 
variables. The sampling function satisfies the relations, 

^T(a;-xa) = l (3.3) 



I' 



a 

dx T{x - Xa) = CTx (3.4) 
Eq.(3.3) ensures that the projections are exhaustive. They are exclusive because T vanishes 
outside a unit interval. 



Another choice for T which is sometimes convenient is a Gaussian of width ax, 

Tix -xa) = -^ exp (3.5) 
(27r)2 V 2(7^ J 

Again Xa = oco'xi but a is now a continuous label. The properties (3.3) and (3.4) still hold, 
given the convention that the summation over a is now an integration. With this choice of 
T the projections are only approximately exclusive. This means that the label a, although 
continuous, really has significance only up to order 1. 

The case of precise samplings, Px — \x){x\^ is obtained by writing Px = a~^P^^ and 



dx (3.6) 



letting (Ta; — > 0, and one has 

'7x'''^{x - Xa) ^ 5{X - X), (^X^^ < 

a 

In a similar manner, one may construct projections for samplings of momentum, 

P| = j dk T{k - kp) \k){k\ (3.7) 
for some sampling function T{k — kp)-, of width crj^, where kj^ = Pcrj^. 

III(B). Samplings of Two Ensembles 

The first result we shall describe envisages a situation in which one has two ensembles, 
prepared in an identical state. Samplings of position are made on the first ensemble, and 
samplings of momentum are made on the second. 
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Consider a position sampling of a system described by a density matrix p. The prob- 
ability that the result lies in the region labeled by a is, 

= J dx T{x — Xa) {x\p\x) (3-8) 

We wish to use the information Ip{X) as a measure of uncertainty in the probability 
distribution p^(q!). By the convexity property (2.8), one has 

Ip{X) ^ -J]p-(a)lnp-(a) 
a 



> —J dx {x\p\x)lii{x\p\x) —Inax 

= IpiX)-\naa: (3.9) 

The IncTa; term arises because of (3.4) 

In a similar manner, we can consider a momentum sampling on an identically prepared 
system, giving the probability distribution 

/(/?) = Tr (P^p) (3.10) 

One may compute its information, Ip(K), and one has, 

Ip{K)>Ip{K)-lnak (3.11) 

A general density operator p may be written 

p = J2ci \iPi){iPi\ (3.12) 
i 

for some set of states {ipi). Again using the convexity property (2.8), one has 

Ip{X) > c, I^^iX) (3.13) 
i 
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where I^. (X) denotes the information of the probabihty distribution obtained from precise 
samphng of the pure state \il>i). A similar result holds for momentum samplings. It follows 
that there exists a pure state lip) such that 

Ip{X)+Ip{K) > I^{X)+I^{K)-ln{a:cak) (3.14) 

with equality for precise samplings and p = (Note that the ln{axO'k) term dissap- 

pears in the limit of precise samplings, since it is taken into the projections on the left-hand 
side, as described above). 

/^(K) and I-ip{X) are individually unbounded from below, since one can always find 
states which are arbitrarily peaked in either position or in momentum. However, a state 
strongly peaked in position, and hence with large negative /^(X), will be very spread out 
in momentum, and thus I^piK) will be large and positive. It is therefore plausible that the 
uncertainty principle will express itself as a lower bound on the sum, I^{X) + I^{K). The 
usual inequality expressing the uncertainty principle. 

Ax Ak > ^ (3.15) 

achieves equality for the minimum uncertainty wave packets. Since they are Gaussians, 
we immediately have, from (2.5), 

I^{X) + I^{K) = In {2neAxAk) 

= ln(7ren) (3.16) 

for the minimum uncertainty wave packets (coherent states). It was therefore conjectured 
by Everett [12] that the uncertainty principle may be expressed in information-theoretic 
terms as 

I^{X) + I^{K) > ln(7ren) (3.17) 

He also noted that this inequality implies the usual form of the uncertainty principle. To 
see this, recall from Section II that the information of a probability distribution is bounded 
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from above by the information of a Gaussian of the same variance. This means that for 
any state with variances Ax, Afc, one has 

In (27reAa;AA;) > (3.18) 

The usual uncertainty principle then follows immediately by comparing (3.17) and (3.18). 
The inequality (3.17) was proved by Beckner [13], Bialynicki-Birula and Mycielski [14] and 
Hirschmann [15], using the Hausdorff- Young inequalities from Fourier analysis. 

Combining all of the above results, we have 

IpiX) + Ip{K) > l + ln(^) (3.19) 

with equality for precise samplings and p a minimum uncertainty wavepacket. Eq.(3.19) 
represents a very modest generalization of (3.17) to the case of imprecise samplings of 
position and momentum. 



III(C). Samplings of a Single Ensemble 



Of greater interest for our purposes is the situation in which the samplings of position 
and momentum are made on the same system. There are a number of ways of doing this, 
and we shall consider them in turn. Perhaps the simplest is to carry out simultaneous but 
imprecise samplings of both position and momentum. These may be effected using the 
coherent state projectors, which we now describe. 

The (canonical) coherent states [16] may be defined to be the states 

\z) = \p,q) = U{p,q)\0) (3.20) 

where |0) is the ground state of the harmonic oscicallator. U{p,q) is the unitary Weyl 
operator, 

U{p, q) = exp (^^{pQ - qP)^ (3.21) 
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where Q and P denote the position and momentum operators. In the position representa- 
tion the coherent states are given by 

Their most important property is the completeness relation, 

dpdq 



I 



27rh 

They are however only approximately orthogonal 



{P, q\p', q') = exp I ^{pq - q'p) - \ 



\p,q){p,q\ = l (3.23) 



ip-p'? , {q-q'f 



+ 



(7| al 



(3.24) 



where (Tpaq = ^. These properties suggest that we may regard the operator 

Pz = \p,q){p,q\ (3.25) 

as an approximate projection operator affecting approximate simultaneous samplings of 
position and momentum. The approximate orthogonality property (3.24), means that the 
labels p and q are coarse-grained momentum and position, having significance only up to 
the widths ap and aq respectively. 

If the state of the system is described by a density operator p, the probability distri- 
bution of approximate position x and approximate momentum k is therefore 

p{k,x) = Ti{Pzp) = {k,x\p\k,x) (3.26) 

This probability is normalized in the measure dkdx/27rh. Consider the information of this 
distribution, 

Ip{K, X) = -J^ pCk, x) InpCk, x) (3.27) 

If p{k, x) were a classical phase space distribution, then (3.27) would be the usual entropy 
in statistical mechanics. The entropy would be unbounded from below because in classical 
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mechanics, the phase space distribution may be arbitrarily concentrated about a particular 
region of phase space. In quantum mechanics, by contrast, phase space distributions 
concentrated on regions smaller in size than h would violate the uncertainty principle. We 
therefore expect a lower bound on (3.27). 

A reasonable guess as to what this lower bound should be is obtained by evaluating 
(3.27) with p a coherent state, since the coherent states are normally thought of as being 
the states most concentrated in phase space. Writing p — \z){z\, one finds 



with equality if and only if p is a coherent state. This was subsequently proved by Lieb [18], 
again using some inequalities from Fourier analysis (best constants in the Hausdorfi- Young 
and Young inequalities). 

A simple but important generalization of this result was noted by Grabowksi [19]. This 
is that the inequality (3.29) continues to hold for projections constructed from a class of 
generalized coherent states, namely, those of the form 



where is an arbitrary state. The point is that they share with the usual coherent states 
the completeness relation, (3.23), and it is this property that is exploited in Lieb's proof. 

This generalization also permits a connection with the usual uncertainty relation to be 
made. One has 



I\,^{K,X) = 1 



(3.28) 



For these reasons, it was conjectured by Wehrl [17] that 



Ip{K,X) > 1 



(3.29) 



(3.30) 



Ip{K,X) < Ip{X) + Ip{K) 




(3.31) 
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where Aa; and Ak are the variances of x and k in the probabihty distribution (3.26), but 
with the generahzed coherent states (3.30). The first inequality is a standard property of 
information; the second is the inequahty (2.6) used twice (up to a factor of 27rh, because 
of our choice of phase space measure). Together with (3.24), (3.31) imphes that 

AxAk > h (3.32) 

This is not the usual uncertainty relation (no factor of because the variances express 
not only the uncertainty in the initial state, but also the uncertainty in the projections, 
which are imprecise. Indeed, one has 

{Axf = {Apxf + {A^xf (3.33) 
{Akf = {Apkf + {A^kf (3.34) 

The first term on the right-hand side of each relation is the variance in the initial state; the 
second is the variance in the generalized coherent state projection with fiducial state IV')- 
Choosing p to be the pure state one thus obtains the usual uncertainty relation, 

A^xA^k > I (3.35) 

An alternative method of connecting (3.29) with the usual uncertainty relations may be 
found in Ref . [8] . 

Results similar to (3.17) and (3.19) have been obtained by Deutsch [20], Partovi [21]. 
and Maassen and Uffink [22]. Eq.(3.17) has been generalized to include thermal fluctuations 
at thermal equilibrium by Abe and Suzuki [23]. Anderson and Halliwell have generalized 
(3.29) to include thermal fluctuations in a class of non-equilibrium systems [8] (see also 
Ref. [9] ) . For an alternative approach to unsharp samplings of non-commuting observables 
using positive operator-valued measures, see Schroeck [24]. For other related results on 
information-theoretic uncertainty relations, not directly relevant to the present paper, see 
Refs.[25,26,27,28,29,30,31,32]. 
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IV. TWO-TIME HISTORIES - 
APPROXIMATE PROJECTORS 



We now show how to obtain information-theoretic uncertainty relations for histories 
characterized by projections at two moments of time. The projections will be onto position 
at two moments of time, or onto momentum and position. The important feature is that 
the time-dependent projections Paitk) do not commute, and so one would not expect 
their probability distributions to be arbitrarily peaked. We therefore expect to derive 
lower bounds on the information, in analogy with (3.29). 

The case of position and momentum samplings by exact projections, such as Eq.(3.2), 
is quite different from the case of approximate projections, such as Eq.(3.5), and each case 
needs to be treated separately. The approximate projection case is a direct extension of 
the results of Section III(C), and we consider this case first. The case of exact projections 
will be treated in the next section. 

IV(A). A Lower Bound on the Information 

In brief, the idea is as follows. The probabilities for histories are most generally given 
by an expression of the form 

p{a) = Tr{clCaP) (4.1) 
where Ca denotes a string of time-dependent projection operators, 

and we use the notation a to denote a string of cu's. The burden of the results described 
below will be to show that for the case of two-time histories considered here, the operator 
CaCa may be written in the form, 

clCa = U{k,x) n U^k,x) (4.2) 
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for some operator Q. The point is that the dependence on the sampled positions and 
momenta x and k resides entirely in the unitary Weyl operator U(k,x). Now CaCa is a 
positive hermitian operator, and thus Q is also. It may therefore be written 

n = J2>^a\a){a\ (4.3) 

a 

where the coefficients Xa are positive. The probabilities (4.1) for our two-time histores 
may now be written 

p{a,P) = J2^a U{k,x) p U\k,x) \a) (4.4) 
a 

Here, we have introduced, as earlier, the continuous bin labels a and /3, defined in terms 
of the sampled positions and momenta by ^ = aax-, k = Paj^, where ax and aj^ are their 
respective widths. The right-hand side of (4.4) involves the expectation value of p in the 
generalized coherent states, |aj^^) = (k,x)\a). By the convexity property (2.8), the 
information of (4.4) satisfies the inequality, 

I{K,X) = - J dadp p{a,p)\np{a,P) 

> - j dxdk {ai^Wi^) In {ai^Wi^) - H<^x<yk) (4-5) 

The factor of \n.[(7x(7k) arises from the change of variables from cu, (3 to k. From the 
previous section, Eqs.(3.29), (3.30), we thus deduce the inequality, 

I{K,X) > 1 + lnf^) (4.6) 

The factor of 2'nh appears because of the difference in phase space measures used in (3.27) 
and (4.5). The factor of 2 difference between (4.6) and (3.19) is due to the fact that at 
equality, (3.19) measure the uncertainty in the state alone, whereas (4.6) also includes the 
uncertainty in the coherent state projector. 

Eq.(4.6) is an intuitively appealing result. The argument of the logarithm is the inverse 
of the number of elementary cells of phase space sampled. If that number is large, ie., 

17 



Ca;Cfc >> 27r^, then the lower bound approaches — oo, and thus the uncertainty principle 
imposes little restriction on samplings of phase space large compared to the fundamental 
cell. On the other hand, the bound becomes significant when (TxO"^ is of order 27r7i or 
smaller, in agreement with the expectation that the uncertainty principle imposes limita- 
tions on samplings comparable to the size of the fundamental cell. 

Everything up to Eq.(4.5) is also true for the discrete case (with the integral over a, 
/?, replaced by a discrete sum), but it is not possible to deduce the inequality (4.6), since 
this holds only for the continous case. 

We now need to show that the projections satisfy the condition (4.2) for the two-time 
histories of interest. It is also necessary to calculate il, to determine the conditions under 
which the inequality becomes equality. Before that, we need to describe some mathematical 
tools. 

IV(B). The Weyl Calculus 

The analysis of (4.1) is conveniently carried out with the aid of a set of mathematical 
tools referred to as the Weyl calculus. This in turn is part of a larger area of mathematics 
called microlocal analysis [33]. The basic idea is to define a one-to-one correspondence 
between every self-adjoint operator, A say, on the Hilbert space, and a real function A{p, q) 
defined in a phase space, referred to as the Weyl symbol of A. A particular example of 
how this correspondence may be obtained is through the Wigner transform. 

When the operator A is the density operator, p, Wp is called the Wigner function. It shares 
many properties of classical phase space distributions, although it is often not positive. It 
has been used extensively in discussions of the classical limit [1,34,35,36]. We shall make 
use of the Wigner transform (4.7) to analyse (4.1). 
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An alternative form of (4.7) that we shall find more useful is, 

WA{p,q) = TT(^A{p,q)Aj (4.8) 

where 

Hp, Q) = ^ f dudv ^-iuP-ivQ (4_g) 
27r J 

Here, Q and P are the usual position and momentum operators, satisfying [Q.,P] = ih. 
We record and prove some useful properites of the Wigner transform. First, one has 

IV(iS) = ^J dpdq Wa{p, q)WB{p, q) (4.10) 

This follows readily from inserting the explicit form (4.7) into the right-hand side of (4.10). 

Next, we discuss the properties of the Weyl symbol under shifts of its arguments. 
Introduce the unitary Weyl operator, 

Uip,q) = e^P^-il^ (4.11) 

It has the properties, 

uhp,Q)QU{p,q) = Q + q (4.12) 

U^p,q)PU{p,q) ^ P + p (4.13) 

The Baker-Campbell-Hausdorff relation is 

e^+B = e^e^e^[^.^] (4.14) 

if [A, B] commutes with A and B. It follows that, 

and thus 

U^ip, q) Aip, q) Uip, q) = Aip + p,q + q) (4.16) 
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From this we see that 

WA{p + P,q + q)=TT(A{p + p,q + q)A^ 
= Iv(A(p,g)i') 

= WA'{p,q) (4.17) 

where A' = U{p,q)AU^ {p,q). That is, translating the coordinates and momenta of the 
Weyl symbol are equivalent to a unitary transformation under the Weyl operator of the 
original operator. 

From (4.1) and (4.10), it follows that the probabilities for histories characterized by 
the chain operator Ca are given by 

p{a) = ^J dpdq W^^^ip, q) Wp{p, q) (4.18) 



IV(C). Position and Direct Momentum Samplings 



The first type of history we shall consider is one characterized by an imprecise position 
sampling at time zero and an imprecise momentum sampling at time t. The probability 
for this history is given by 



p(a,/5,t) = TV 



pk-iHt r)X ^ ryXjHt 



(4.19) 



In the short time limit, employed here, evolution is described by the free Hamiltonian. 
This clearly commutes with the momentum projections, and thus t drops out in the short 
time limit. One thus has 



/^t/^ _ r)Xr)kr)X 
^qya — 



(4.20) 



The Weyl symbol of this operator is 



(4.21) 
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Inserting the exphcit forms for the projection operators, one obtains. 

Letting A; ^ A; + ^, it is readily seen that one has, 

W^^C^P^ <l) = Wn{p-k,q- x) (4.23) 

Here, Q is the operator whose Weyl symbol is (4.22), but with ^ = and x = 0. ft is 
therefore equal to P^P^P^ at = and x = 0, that is, 

n=-^ [ dxdydk T{x)T{y)T{k) et^(^-2') \x){y\ (4.24) 

ZTTh J 

From (4.17), (4.23), we therefore have a result of the general form (4.2), 

P^P^P^ = U^k,x) n U{k,x) (4.25) 

From the above, it therefore follows that the information of the phase space distribution 
(4.19) obeys the inequality (4.6). 

Consider now the conditions for equality. Equality is obtained if and only if both p 
and ft are of the form |2;)(2;|, where \z) is a canonical coherent state, (3.20). From (4.24), 
one can see that Q will be of that form, if and only if T{k) = S{k) and T(x) is a Gaussian. 
That is, the first projection is a Gaussian projection onto position, and the second is an 
infinitely precise sampling of momentum. 

IV(D). Position and Time-of-Flight Momentum Samplings 

We now consider a history characterized by imprecise position samplings at times 
and t. The probability is 

p(ai, «2, t) = Tv [P^,e-*^^P^^pP- e^^*] (4.26) 
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From this one can construct a phase space probabihty p{ai,P,t), where Paj^ = k, and 
k = m{x2 ~ ^l)/tj for small t. We have xi = (TxCXi, X2 — OxOli-, thus (i — a^ — ol\ and 
CTj^ = mux/t. One has, 

clCa = P^,e^H'P^,e-^H'P^, (4.27) 

We will analyse this case for small times t. It is readily shown that the Weyl symbol is 

W^^^{p, q) =^ j d^dx2 e-iP^ Tiq +^^- xi) Tiq - - xi) T(x2 - X2) 

X {X2, t\q - 0) {X2, t\q + 0)* (4.28) 

Now in the short time limit, the propagator is given by 

<^-'l''-^«'°)=(2^)^-^(i^(^-''n0j ("^) 

Inserting this in (4.28), and performing the shift X2 —>■ X2 + X2, one finds that the answer 
may be written in the form 



^ctc(^'^^= dCdx2riq + ^^-xi)riq-^c-xi)rix2] 



m 



I 

X exp I — 



m . 
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P - y (a;2 - xi)^ ^ + i^{x2-q + xi)^ j (4.30) 
One therefore has 

W^^C^P^ q) = Wn{p-k,q- xi) (4.31) 

Q is the operator whose Wigner transform is (4.30) with xi = and X2 = 0. Explicitly, 

n = J dxdydx2 {X2,t\y,0) {x2,t\x,0y T {x)T {y)T {X2) \x){y\ (4.32) 

From (4.17), (4.31), we now have the result, 

P^,e^^*P^,e-*^*P^^ = U'fikx) n U{k,x) (4.33) 

We therefore again have the inequality (4.6), for the information of the phase space distri- 
bution constructed from (4.26). 

22 



Consider the conditions for equality. Again this is achieved when both p and Q, are of 
the form This means that Eq.(4.30) must be the Wigner transform of a coherent 

state, i.e., a product of Gaussians in p and q. This can be achieved by letting the width 
of the sampling function at t go to a delta-function, setting the sampling function at t = 
to a Gaussian, and then letting t — > cxo. (This may be seen explicitly in Ref.[36]). We are, 
however, working in the short time approximation, so this procedure can be carried out 
only for the free particle case, for which the short time approximation is exact. 

It is also possible to deduce a lower bound on the information of the joint probability 
for position samplings, (4.26). The information of p(Q:i,Q!2,t) is in fact equal to that of 
p(q!1,/?, t), because the Jacobean of the transformation between these variables is unity. 
One thus has the following bound on the information of (4.26): 



This is strictly speaking a trivial rewriting of (4.6). We record the result because it will 
be generalized to an arbitrary number of position samplings in Section VI. 



The previous case concerned position samplings for any Hamiltonian, but in the limit 
of small time separations. For the case of linear systems, we may extend this analysis to 
arbitrary time separations. We now outline how this is done. 

The propagator for linear systems is given by. 




(4.34) 



IV(E). Position Samplings at Arbitrary Time Separations 




(4.35) 
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where S is the action of the classical solution connecting initial and final points, and is 
quadratic in the x^s. The prefactor A is independent of the x's, and is given by 



A{t",t')^ 



1 d'S{x",t"\x',t') 



(4.36) 



27Tin dx"dx' 

Repeating the analysis of the previous subsection, Eq.(4.28) thus has the form 

W^^^{p,q)= d^dx2r{q + ^^-xi)T{q-^^-xi)T{x2-X2) \A\^ 

X exp (^-^pC + ^S{X2, t\q - ^C, 0) - ^S{x2, t\q + 0)) (4.37) 

Now, letting X2 ^ X2 + X2, and using the fact that S is quadratic, (4.37) may be written, 

W^^^{p,q)^ d^dx2r{q+^^-xi)T{q-^^-xi)T{x2) |A|2 

X exp (^-^Cip -k)- ^^|^(^2, t\q - XI, 0)^ (4.38) 

where we have introduced 

dS 

^=-^(^2,^1^1,0) (4.39) 

From Hamilton-Jacobi theory, k is the initial momentum for the classical path between xi 
and X2- Now the point is that (4.38) depends on X2 and xi only through the combinations 
p — k and q — xi, and we again have a result of the form (4.33), but this time with k 
given by (4.39). We therefore again deduce the inequality (4.6), for the information of the 
corresponding phase space distribution. 

What is perhaps more interesting in this case is to derive the generalization of (4.34). 
Since k is linear in X2, xi in (4.39), we have, 

y dk _ dk _ 

k = -^X2 + ^xi 4.40 
OX2 oxi 



and thus. 



^ (Tx f dk dk , / . . X 

P= — i ^a2 + 4.41 

CTjt \OX2 oxi ' 
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Here, as before, (3a]^ = k, aiax = xi and a20'x = X2- Unlike the case of short times, the 
transformation from cti, /3 to ai, a2 has non-trivial Jacobean. It follows that 



/ CTx 


dk 




dx2 



I{Xi,X2)^I{K,X)-ln 



Finally, using the bound (4.6) on I{K, X), and noting that 

dk d'^S ,^ , 

^z- = ^_ (3:2,^ x1,0) 
0x2 0x10x2 



(4.42) 



(4.43) 



we derive the following bound on the information of position samplings at arbitrary time 
separations. 



I{Xi,X2) > 1 + In 



2nh 



dxidx2 



-V 



(4.44) 



We will generalize this result, and discuss it further in Section VI. 



V. TWO-TIME HISTORIES - EXACT PROJECTORS 

As stated in the previous section, the case of exact projections is rather different to 
the case of approximate ones and needs to be treated separately. In this section we show 
how this is done. 

We are again interested in an expression for the probability of a two-time history of 
the form (4.1), where CqCa is of the form (4.20) or (4.27). In each case it is again possible 
to show that CaCa may be written in the form (4.2), although note that now Xa, k(j are 
discrete rather than continuous variables. We can go on to use the steps (4.3) to (4.5), 
except that the integral in (4.5) becomes a discrete sum, and it is at this point that we 
can go no further. Of course, if the bin sizes ctx, ctj. are very small, then the discrete sum 
may be approximated by the continous integral (4.5), and we deduce the inequality (4.6). 
But more generally a different method is needed. 
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Very generally, probabilities for histories are given by an expression of the form (4.1). 
If the projections contained in the chain operators Ca are exact projections, and either 
fine-grained projections onto discrete variables {e.g., spins), or coarse-grained projections 
onto continuous variables {e.g.., as in Eq.(3.2)), then the variables a labeling the alter- 
natives form a discrete set, and so there are a discrete (although possibly infinite) set of 
probabilities p{a). This means that they possess an upper bound, p{a) < Pmax < 1; and 
a lower bound on the information follows trivially: 



(Note that this is not true of the information of continuous variables. There, the p(a)'s are 
not probabilities, but probability densities, and so need not be bounded from above.) The 
upper bound Pmax may be computed by studying the spectrum of the operator C^Cq,. In 
particular, the bound (5.1) will be non-trivial, i.e., Pmax < 1) if at least one pair of the 
time-dependent projections Pa^{tj.) do not commute [37]. 

Now consider the case of two-time histories. As stated, everything in Section IV from 
(4.1) to (4.5) also holds in the case of exact projections. Suppose we obtain the spectrum 
of the operator Q,, Eq.(4.3), and we look for the largest eigenvalue, Xmaxi thus < Xmax- 
It follows from (4.4) that 



a 

and thus Pmax = Xmax- Position and momentum projections at the same time, or position 
projections at different times do not commute, thus the bound (5.1) will be non-trivial. 




(5.1) 




(5.2) 



V(A). Position and Direct Momentum Samplings 



Consider first the case of a position followed by momentum sampling, so we have (4.20), 
but with exact projections. We again deduce (4.25), so we are interested in the spectrum 
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of the operator f2, given by (4.24). Now write 



Q\u) = X\u) 



(5.3) 



Inserting the exphcit form of Jl, and performing the k integration, one obtains the eigen- 
value equation 



where U = aj^ax/^Trh, x = ax{^ — and w{^) = {x\u). Apart from ^-functions on the 
left-hand side, this equation is identical to an eigenvalue equation written down by Partovi 
in his study of the analagous question for the case of samplings of two ensembles, as in 
Section III(B) [21]. It is not clear whether it can be solved exactly, but it is straightforward 
to extract the relevant information in regimes of interest. For U << 1, the kernel on the 
left-hand side is approximately equal to U. The spectrum is degenerate with A ~ f/, and 
w{^) a constant on the interval [0, 1] and zero elsewhere. For U » 1, the kernel becomes 
a delta- function, — ^'). The eigenvalue equation is then satisfied by any function 
with support only in the interval [0,1] (up to normalization), and the spectrum is again 
degenerate with A 1. The following bound on the information is thus obtained: 



Like the continous case, (4.6), the result is intuitively appealing. The lower bound 
is non-trivial for probes of phase space comparable to or smaller than the fundamental 
cell. On the other hand, there is no restriction when the probe is much larger than 
the fundamental cell, and the lower bound is essentially zero. (It is not — oo, as in the 
continuous case, because information is non-negative for discrete distributions). 

Note that the bounds (5.5) and (4.6) approximately coincide for the case crj^ax « 2tt?i. 
This is to be expected since as stated above, this is the condition that the discrete and 
continous version of (4.5) coincide. 




(5.4) 




(5.5) 
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V(B). Position and Time-of-Flight Momentum Samplings 



In the case of time-of-flight momentum samplings, we study (4.26) with exact position 
projections. We again have (4.33) and we thus need to find the largest eigenvalue of the 
operator Q, in this case given by (4.32). It is straightforward to show that the eigenvalue 
equation is, 

^(0^(1-0 jj'^^e'exp(-i7rC/(e-e')(e + ^'+l)) '-^^^^^^^ff^ "^(^'^ = MO (5.6) 

where the various quantities are all the same as in (5.4), recalling that aj^ — max ft, as 
in Section IV(C). It is not difficult to see that the presence of the exponential factor in 
(5.6) in comparison to (5.4) actually makes no difi'erence to the leading order asymptotic 
solutions in the regions U » 1 and U « 1. We thus once again obtain the result (5.5). 

As in Eq.(4.34), one can again use this result to obtain a bound on the information of 
the joint probability of position samplings. In this case it is, 

in the regime mcr^ << 2T^%t. Similarly, we expect to be able to derive a result of the form 
(4.42), for linear systems, in the exact projections case, although we do not describe this 
in detail. 



\mail 



VI. GENERAL HISTORIES 

We have studied the uncertainty principle for histories characterized by position and 
momentum projections at two moments of time. We now go on to study the more general 
case of histories characterized by position projections at an arbitrary number of times [38] . 
On general grounds, and inspired by specific calculations [5], we expect the probability for 
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a sequence of position samplings to be in some sense peaked about sets of solutions to the 
classical field equations, with a weight depending on the initial state. The precise sense in 
which this is true is discussed in another paper [39] (see also Ref.[40]). One expects the 
uncertainty principle to impose a limitation on the degree of peaking. Here, we derive an 
information-theoretic inequality expressing this limitation for histories characterized by an 
arbitrary number of position samplings. This is a generalization of the results (4.34), (4.42) 
and (5.7). We then obtain the form of the uncertainty principle for histories characterized 
by other types of projections. 

As in Section V, if some of the projections in the chain operators Ca do not commute, 
then the spectrum of the operator CaC\ is strictly less than 1, and likewise the probabilities 
p{a). A lower bound on the information of the form (5.1) is thus obtained. Let us apply 
this rationale to strings of imprecise position projections, with sampling functions of the 
form (3.2). Our aim is to obtain a lower bound on the information 

/(Xi, • • • Xn) = - XI ■ ■ ■ 5Z •••««) lrip{ai ■■■an) (6.1) 

ctl an 

The expression (1.3) for the probabilities may be written, 

P(«) = j dxQdyo {yo\ciCa\xo) p{xo,yo) (6.2) 

where 

{yo\CaCa\xo) = / n ^^kdyk ^i^n - Vn) T(x^ - Xk)T{yk - ^fc) 
k=l 

n 

X n -^i^k^Vk^tkl^k-hyk-hh-l) (6-3) 
k=l 

Here, as in previous sections, Xf^ = aaj.. The samplings functions T are given by Eq.(3.2). 
J is the density matrix propagator, which for unitary evolution is given by 

J{x",y",t"\x',y',t') = {x",t"\x',t') {y" ,t"\y' ,t')* (6.4) 
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We shall work in the limit that the time separation between each projection is small. The 
propagators in (6.4) are then given by (4.35), (4.36). This is exact for linear systems. The 
case in which J is a non-unitary reduced density matrix propagator is also of interest in 
the context of decoherence models (see Refs.[5,41], for example). However, such propaga- 
tors reduce to the unitary expression (6.4) in the short time limit, hence our results are 
applicable to that case also. 

For simplicity, we study first the free particle case, for which one has 

and 

or II Ji\ I j.i\ 'ni{x" — x')'^ . . 

S{x ,t \x,t)= _ (6.6) 

Also, let all of the projections have the same width a, and let the time separation between 
all slits be t (except for ti and tQ - see below). 

We wish to estimate the largest eigenvalue of the operator CaCa. The eigenvalue 
equation is, 

j dxQ {y{)\clPa\x{)) u{x{)) = Xu{yo) (6.7) 

The expression (6.3) occurring in (6.7) has the form of a discrete version of a sum over 
histories. It may be regarded as a sum over pairs of paths, starting at xq and i/q, passing 
through gates of width a at times ti - ■ ■ tn, and meeting in the final gate at point Xn, which 
is integrated over the width a. We may approximately evaluate (6.3), and hence solve the 
eigenvalue equation, by looking for the paths which dominate the integral in the regimes 
of interest. 

We follow a heuristic argument previously used by Mensky in a related context [40]. 
There are two competing effects that will determine which paths dominate. On the one 
hand, if the slit widths in the projections are very small, this will force the paths to follow 
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the set of alternatives xj^ specified by the projections. On the other hand, if the action of 
each path (i.e. the sum of the phases of the propagators) is very large, S » h, then by 
the stationary phase approximation, we expect the dominant paths to be those extremizing 
the action, i.e., classical paths. 

Consider first the case in which the slit widths are very small. In this case the paths are 
forced to follow the sampling positions xj^. The action S of each path is of order ma'^/t. 
We therefore take "o" small" to mean that S << h. This implies that the exponential 
part of the propagators in (6.3) is negligible, and only the prefactors contribute. We may 
therefore approximately evaluate the integral (6.3), with the result 



The origin of each part of this expression is as follows: the factor {m/2'nfit)''^ comes from 
the (n — 1) propagators J; the factor (cr^)"^~^ comes from the integrations over x and y at 
times t2 to tn-i, noting that J is approximately constant, and recalling Eq.(3.4); the factor 
of (7 comes from the final integration over Xn- The remaining integrations over J in (6.8) 
arise due to the fact that the density matrix in (6.2) is at the initial time Iq, and not at the 
time ti at which the first projection is made. This is merely a notational inconvenience - 
the very last part of the chain operators Ca is an evolution operator from to ti. It is 
readily removed by letting ti — > to! thus J becomes a product of delta-functions and (6.8) 
becomes. 



Inserting this in the eigenvalue equation (6.7), one thus finds that the spectrum is degen- 
erate, with 



The eigenfunctions are functions constant in an interval of size cr and zero elsewhere. 




(6.8) 




(6.9) 




(6.10) 
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Next, let the slit widths be very large. The action of each section of path is then 
allowed to be large, and it is the stationary phase effect that will dominate. The dominant 
contribution to the sum over histories will therefore come from the immediate vicinity of 
the classical paths. When the sampling positions are chosen to line up according to the 
classical path, it is as if the projections are not there, since most of the integral comes 
from this regime anyway. It follows that 

(yoiclc'ako) ^S{xo-yo) (6.11) 
and we thus find that Xmax ~ 1- 

Combining these two cases, we thus obtain the following for the lower bound on the 

information of a sequence of position samplings, 

{0, if mcr^ >> ht; 

(n-l)ln(l^), if ma^«nt. ^^'^^^ 

This lower bound is what one might intuitively expect. First of all, large a is essentially 
the classical regime, in which we do not expect to suffer limitations on our ability to describe 
a history; hence there is no restriction on the information. Secondly, the case of small a 
is essentially (4.34) generalized to an arbitrary number of samplings. We might expect 
it because when a is small, the projectors are almost fine-grained. They "pinch off" the 
probability (6.2) - it becomes approximately equal to a product of probabilities for two- 
time histories of the type discussed in Sections IV and V. Indeed, the bound in (6.12) is 
just a sum of bounds of the type (4.34). We will see this in more detail below. 

Generalizations of (6.12) may be obtained. The above analysis is readily generalized 

to the case in which the slits widths aj and the time separations (^j+i — tj) are different, 

and the short time propagator is given by the more general expression (4.35). It is then 

straightforward to show that the lower bound in (6.12) is, in the small aj regime, 

n-1 

Imin - In (^aj+iaj\A{tj+i,tj)\'^^ (6.13) 

i=i 
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(and again Imin ~ in the large aj regime). Eq.(6.13) is the leading order behaviour of 
Imin for small aj, and for small time separations. For linear systems it is valid for arbitrary- 
time separations. How are we to understand this expression? 

For the phase space samplings considered earlier, the significance of the lower bounds 
(4.6), (5.5), is intuitively clear: the argument of the logarithm is the ratio of the funda- 
mental phase space volume 27rh to the sampling volume ctxCT]^. 

The lower bound (6.13) has a rather different form; yet an analagous interpretation 
suggests itself. The propagator prefactor |A(tj_|_i, tj)p has the dimension of (length)"^ 
and is commonly regarded as the "density of paths" . Introduce the quantity. 



for n = 2, 3 • • •. For the case of position samplings it has the dimension (length)"^*^"^. 
It might therefore reasonably be regarded as the fundamental "history space volume". 
Eq.(6.13) may then be written in the suggestive form. 



Eq.(6.15) now has exactly the same structure as the information-theoretic bounds (4.6), 
(5.5), on the phase space samplings considered earlier: the argument of the logarithm in 
(6.15) is the ratio of the fundamental history space volume to the sampling volume. 

It is natural to ask how the results of this section might be further generalized to 
histories characterized by samplings of variables other than position. It is actually not 
difficult to see that the above results generalize to histories characterized by samplings 
of any continuous quantity, such as momentum, angular momentum, etc. Let Pa be an 
imprecise sampling of some continous quantity a: 



n-1 



vh=11 mj+i,tj)\ 



-2 



(6.14) 




(6.15) 




(6.16) 
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The projections partition the variable a into bins of size a labeled by a. We may take the 
projections to be onto different variables at each moment of time. In the limit of small 
widths, it is not difficult to see that the analysis for the position samplings case described 
above readily goes over to the case of arbitrary continuous variables aj^. Essentially what 

happens is that in the small aj limit, the matrix elements of the operator Cj^Ca become 

products of propagators and slit widths, in analogy with Eq.(6.8). More precisely, 

n-1 



. n-1 

{aQlClCglao) fti / dai / da[ Y\_ ^j+l 
J(Ti Jai ^-^j^ 



X — («i,^il«o>^o) («i,^il«o,^o)* (6.17) 

We therefore again deduce the lower bound on the uncertainty (6.15), for this much more 
general class of histories. The factors A(tj+i, tj) in (6.14) are now identified with the short 
time limit of the progators {ajj^i^tjj^i\aj,tj) (maximized over the alternatives aj^i,aj, 
in the event that the propagator depends on them in the short time limit). We may thus 
write 

I{Ai, ^2, ■ ■ ■ An) > Imin ~ In ( ) (6.18) 

in the small aj regime. Here, ^i, • " "^n denotes a string of alternatives which can be 
any continuous variables, and may be different variables at different times. 

Let us test this more general result with a simple case. Consider a history characterized 
by a position projection at time ti and a momentum projection at time t2- Thus ai — ax 
and (72 = CTfc. The short time propagator is 

{p,t2\x,ti) a; — ^—^exp(-^-^-^ — —-ipx] (6.19) 

O 

The history space volume is therefore Vjj = \A{t2,ti)\~ = 27rh. The history space 
volume element is not just analogous to the factor of 27rh for phase space samplings: it 
is equal to it in this case. Moreover, the general result (6.18) coincides exactly with the 
expected result (5.5) for phase space samplings. 
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Eq.(6.18) is the main result of this paper: a concise and very general expression of 
the uncertainty principle, expressed in the language of quantum-mechanical histories, not 
referring in any way to phase space but reducing to the phase space form in the appropriate 
circumstances. 

The expression of the uncertainty principle (6.18) refers to a fundamental history space 
volume Vfj. It is obtained in (6.14) from the short time behaviour of the propagator, and is 

i_ TTj. 

thus uniquely determined given the unitary evolution operator, e r . That this operator 
should appear in the statement of the uncertainty principle for histories should come as 
no surprise. Unlike phase space statements, the description of a history depends on both 
the projection operators at each moment of time and the unitary evolution between them. 

Of course, we have not defined the "history space" of which Vjj is the volume element. 
We shall not pursue this question here, except to note that it appears to be related to the 
Cartesian product space si x S2 • • • Sn, where sj is the spectrum of the observable projected 
at time tj. This has been discussed by Omnes [42]. It is also perhaps interesting to note 
that the existence and relevance of such a space is indicated by the form of the uncertainty 
relation (6.18) 

VII. DISCUSSION 

In this paper, we addressed a simple question: How is the uncertainty principle encoded 
in the probabilities for histories, Eq.(1.3)? A simple but very general answer is offered: it 
arises as the lower bound on the Shannon information, Eq.(6.18). 

We have stressed the generality of the lower bound (6.18) within the framework of 
standard quantum mechanics (or at least, its modest generalization to histories). Yet 
the information-theoretic approach employed here has a potentially greater degree of gen- 
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erality. Information as a measure of uncertainty depends solely on the probabilities for 
histories. This is in contrast to the usual variance form of the uncertainty principle, (1.6), 
which depends on the wave function of the system at a fixed moment of time. The gener- 
ality of the information-theoretic form suggests that it might survive to broader forms of 
quantum mechanics, such as the generalized quantum mechanics suggested by Hartle [2], 
which attempts to get away from the Hilbert space formulation. For even if a formulation 
of quantum mechanics does not deal with wave functions, it must deal with probabili- 
ties: information-theoretic measures may therefore exist where Hilbert space-dependent 
measures do not. 

To be more precise, we conjecture that the uncertainty principle will most generally 
arise as a lower bound on the information, of the form (1.8), even in generalized formu- 
lations of quantum mechanics in which a statement in terms of variances is not available. 
A stronger conjecture is that the general form of the lower bound (6.18) will also survive 
such generalizations. These are, however, difficult issues to address in the absence of a 
concrete generalization of quantum mechanics. They will be taken up elsewhere. 
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